\documentclass[a4paper, 12pt]{book}
%\documentclass[a4paper, 12pt]{paper}


\author{Daniele Sirigu}
\title{ A gesture-based portable camera-projector system for augmenting research artefacts}

\usepackage[english]{babel}
%\usepackage[latin1]{inputenc}
\usepackage[utf8x]{inputenc}
\PrerenderUnicode{àèéìòùöü–}





\usepackage{vmargin}	% needed for page style
\usepackage{amsmath}    % needed for subequations
\usepackage{amsfonts}	% needed for math symbols
\usepackage{graphicx}   % needed for figures
\usepackage{verbatim}   % useful for program listings
\usepackage{color}      % used if color is used in text
\usepackage{subfig}  	% used for side-by-side figures
%\usepackage{hyperref}   % used for hypertext links, including those to external documents and URLs
\usepackage[dvips,colorlinks=true]{hyperref}
\hypersetup{
	bookmarksnumbered=true,
	linkcolor=black,
	citecolor=black,
	pagecolor=black,
	urlcolor=black,
}
\usepackage{makeidx}	% used for making analytical index

\usepackage{fancyvrb}
\DefineVerbatimEnvironment{code}{Verbatim}{fontsize=\small}
\DefineVerbatimEnvironment{example}{Verbatim}{fontsize=\small}


% To make index
\makeindex

\pagestyle{headings}


% Horizontal separation line
\newcommand{\sepline}{\begin{center}\begin{tabular*}{.5\textwidth}{c}\hline\end{tabular*}\end{center}}


% Line Spread
%\linespread{1.4}

% Margins
\evensidemargin=3.35cm
\oddsidemargin=4.75cm
\addtolength{\textwidth}{-0.925in}
%\addtolength{\topmargin}{+.475in}
\addtolength{\textheight}{-0.925in}


% \textit{•} for italic




\begin{document}
\phantomsection
\pdfbookmark[0]{Title}{title}
\DeclareGraphicsExtensions{.pdf,.png,.gif,.jpg}

% FRONTISPIECE
\thispagestyle{empty}

\begin{center}

%	\includegraphics[scale=0.35]{images/dany-img001.png}\\				% UNIVERSITY LOGO
	\begin{figure}[htb]	
		  \vspace*{-0.5in}
		  \hspace{-1.5in}	
		  \hspace{0.95in}  	  	  
		  \subfloat{\includegraphics[width=0.4\textwidth]{images/dany-img001.png}}         
		  \hspace{8em}%	    
		  \subfloat{\includegraphics[width=0.5\textwidth]{images/dany-img002.png}}
	\end{figure}
	\vskip 1.0 cm
	\textsc{Facoltà di Scienze Matematiche, Fisiche e Naturali}			% FACULTY
	\sepline
	\textsc{Corso di Laurea Specialistica in Tecnologie Informatiche}	% COURSE
	\vskip 2 cm
	
	%\begin{sffamily}
	MASTER THESIS													% THESIS
	\vskip 0.5 cm
	\LARGE
	A gesture-based portable camera-projector \\ system for augmenting research artefacts % THESIS TITLE
	\normalsize

	\vskip 3 cm			
	
	
	
	\begin{tabular}{llrrrrrrrrrrrr}
		\large{\textsc{Supervisors}} \\									% SUPERVISOR
		\normalsize
		Dr. Mike FRASER \\
		Dr. Stefano CHESSA \\ \\
		&&&&&&&&&&&&& \textsc{Candidate} \\								% CANDIDATE
		&&&&&&&&&&&&& Daniele SIRIGU
	\end{tabular}
	
	\vskip 2 cm
	\sepline
	Session $V$\\														% SESSION
	Academic Year $2010-2011$											% ACADEMIC YEAR			
\end{center}


% Blank page
\newpage
\thispagestyle{empty}
\mbox{}





% DEDICATION
\vspace*{10pc}
\begin{flushright}
	\clearpage
	\phantomsection
	\pdfbookmark[0]{Dedication}{dedication}
	\thispagestyle{empty}
	\textsl{Ai miei genitori}
	\vspace*{10pc}
\end{flushright}
\par\vfill\par





% Blank page
\newpage
\thispagestyle{empty}
\mbox{}




% DECLARATION
\clearpage
\pagenumbering{roman}
\phantomsection
\pdfbookmark[0]{Declaration}{Declaration}
\chapter*{Declaration}
A dissertation submitted to the University of Pisa. Except where specifically acknowledged, it is all the work of the Author. \\ \\
Daniele Sirigu, February 2012




% Blank page
\newpage
\thispagestyle{empty}
\mbox{}




% ACKNOWLEDGMENTS
\clearpage
\phantomsection
\pdfbookmark[0]{Acknowledgements}{acknowledgements}
\chapter*{Acknowledgements}
The research work presented in this thesis has been carried out at the Interaction and Graphics laboratory, Merchant Venturers Building, Bristol as part of an international Erasmus exchange program among the universities of Pisa and Bristol.

I would like to express my gratitude to all those who gave me the possibility to complete this thesis.

First of all, I would like to thank my project supervisor Dr. Mike Fraser for the opportunity to come in Bristol  as a member of the PATINA group and for his continual support and advice throughout this work. 

Many thanks also to my home supervisor Dr. Stefano Chessa for his support, advice and for giving me the opportunity to study abroad.

I would like to thank all the members of the PATINA team and of the Bristol Interaction and Graphics group for giving me friendship and support.  In particular, I'm sincerely grateful to Dr. Peter Bennet, for all his advices, specially in the last period of my stay.

As Erasmus student, I would like to extend my appreciation to all of the staff and students at the University of Bristol for welcoming me into their group and giving me friendship throughout the course of this work.

Special thanks go to my colleagues of the University of Pisa who have also become very close friends over these years and in particular to Nicola Desogus for his help in revising this thesis.

I would like to give a big thank you to my brother, my family, my closest friends, my flatmates and all the friends I made during my stay in UK for their friendship and assistance on anything I needed.

Last, but most important, I would like to thank my parents for their continual support throughout my undergraduate and postgraduate years of study and for their sacrifices that made this thesis possible.



% Blank page
\newpage
\thispagestyle{empty}
\mbox{}





% ABSTRACT
\clearpage
\phantomsection
\pdfbookmark[0]{Abstract}{abstract}
\chapter*{\centering Abstract} 
In this thesis we present a portable camera-projector system controlled by a gestural human-computer interface. The system identifies  research artefacts projecting related information directly back into the research space. Our main assumption is that the objects present text data on their surface and that they are in the device's field of view. 

Traditional camera projector systems for text recognition employ only standard RGB cameras. These type of devices can be employed for a variety of applications but little attention has been paid to their application in the task of recognising texts in natural scene images.
Previous work has been centred on the location of text in whole images, without any real interaction and requiring good resolution across the whole image. The novelty of our approach lies in the using a depth camera to assist the system in the recognition of artefacts of interest, making text tracking interactive and therefore reducing the resolution demands.

Design concepts, a prototype implementation and experimental results that show the validity of our approach are presented. Furthermore, in order to create a portable device, we undertake a preliminary investigation on the feasibility of jointly employing  depth cameras with small sized computer units, which to the best of our knowledge has never been done before.



% Blank page
\newpage
\thispagestyle{empty}
\mbox{}






% TABLE OF CONTENTS
\clearpage
\phantomsection
\pdfbookmark[0]{Table of Contents}{tableofcontents}
\tableofcontents
\thispagestyle{empty}
\clearpage



\pagenumbering{arabic}


% CHAPTERS
\input{chapters/01_introduction}
\input{chapters/02_background}
\input{chapters/03_system_design}
\input{chapters/04_comparison_and_analysis}
\input{chapters/05_software_implementation}
\input{chapters/06_results}
\input{chapters/07_conclusions}



% Blank page
\newpage
\thispagestyle{empty}
\mbox{}

\appendix
\include{chapters/appendix}
\include{chapters/appendixB}
\include{chapters/appendixC}

% Blank page
\newpage
\thispagestyle{empty}
\mbox{}


% LIST OF FIGURES
\clearpage
\phantomsection
\pdfbookmark[0]{List of Figures}{listoffigures}
\listoffigures
\thispagestyle{empty}
\clearpage



% Blank page
\newpage
\thispagestyle{empty}
\mbox{}


% BIBLIOGRAPHY
\clearpage
\phantomsection
\pdfbookmark[0]{Bibliography}{bibliography}
\begin{thebibliography}{13}


\bibitem{1}{http://patina.ac.uk/}

\bibitem{2}{Brian Curless. \textit{From Range Scans to 3D Models}. ACM SIGGRAPH Computer Graphics 33 (4), pp. 38-41, November 2000.}

\bibitem{3}{Medina A, Gayá F, and Pozo F. \textit{Compact laser radar and three-dimensional camera}. 23 (2006). J. Opt. Soc. Am. A. pp. 800–805.}

\bibitem{4}{Underkoffler, H Ishii. \textit{Illuminating Light: An Optical Design Tool with a Luminous-Tangible Interface}. CHI ’98 Making the Impossible Possible (1998) Volume: 98pp, Publisher: Addison-Wesley/ACM Press, Pages: 542-549.}

\bibitem{5}{Cauchard, J., Fraser, M., Alexander, J. and Subramanian, S.. \textit{Offsetting Displays on Mobile Projector Phones}. Proc. First International Workshop on Personal Projection (UbiProjection 2010), pp. 12-14, Helsinki, Finland, 2010.}

\bibitem{6}{M. Baldauf and P. Fröhlich. \textit{Supporting Hand Gesture Manipulation of Projected Content with Mobile Phones}. Proceedings of the Workshop on Mobile Interaction with the Real World (MIRW), 2009.}

\bibitem{7}{Danny Crasto, Amit Kale and Christopher Jaynes. \textit{The Smart Bookshelf: A study of camera projector scene augmentation of an everyday environment}. WACV05 (I:218-225).}

\bibitem{8}{David M. Chen, Sam S. Tsai, Bernd Girod, Cheng-Hsin Hsu, Kyu-Han Kim, Jatinder Pal Singh. \textit{Building book inventories using smartphones}. in Alberto Del Bimbo, Shih-Fu Chang, Arnold W. M. Smeulders, editors, Proceedings of the 18th International Conference on Multimedea 2010, Firenze, Italy, October 25-29, 2010. pages 651-654, ACM, 2010.}

\bibitem{9}{Markus Löchtefeld, Sven Gehring, Johannes Schöning, Antonio Krüger. \textit{ShelfTorchlight: Augmenting a Shelf using a Camera Projector Unit}. Adjunct Proceedings of the Eighth International Conference on Pervasive Computing, Helsinki, Finland, Springer Lecture Notes in Computer Science, 2010.}

\bibitem{10}{Merino, C., Mirmehdi, M. \textit{A Framework Towards Realtime Detection and Tracking of Text}. In: Second International Workshop on Camera-Based Document Analysis and Recognition (CBDAR 2007), pp10–17. Sept 2007.}

\bibitem{11}{Jordi Pagès, Joaquim Salvi, Josep Forest. \textit{New Optimised De Bruijn Coding Strategy for Structured Light Patterns}. ICPR (4) 2004: 284-287.}

\bibitem{12}{Brian Curless. \textit{From Range Scans to 3D Models}. ACM SIGGRAPH Computer Graphics 33 (4): 38–41. doi:10.1145/345370.345399 (November 2000).}

\bibitem{13}{Medina A, Gayá F, and Pozo F. \textit{Compact laser radar and three-dimensional camera}. 23 (2006). J. Opt. Soc. Am. A. pp. 800–805.}

\bibitem{14}{D. Comelli, G. Valentini, R. Cubeddu, L. Toniolo. \textit{Fluorescence Lifetime Imaging and Spectroscopy}. Handbook on the Use of Lasers in Conservation and Conservation Science. Eds. Manfred Schreiner and Matija Strlič, COST G7 (2007) Ch. 4.4 pp.1-15. ISBN-13-978-973-88109-3-8}

\bibitem{15}{David Pogue. \textit{Shazam! A Projector Is Shrunk}. The new york times Published: November 4, 2008.}

\bibitem{16}{AIM. \textit{Optical Character Recognition (OCR)}. published by: AIM, Inc. 634 Alpha Drive Pittsburgh, PA 15238-2802, US.}

\bibitem{17}{Ming-Hsuan Yang. \textit{Object recognition}. Encyclopedia of Database Systems (eds. L. Liu and M. T. Ozsu), pp. 1936-1939, 2009.}

\bibitem{18}{P. Viola. \textit{Feature based recognition of objects}. in Proceedings of the AAAI Fall Symposium Series: Machine Learning in Computer Vision: What, Why	 and How? 1993 pp. 60-64.}

\bibitem{19}{Filip Krolupper, Jan Flusser. \textit{Polygonal shape description for recognition of partially occluded objects}. Pattern Recognition Letters 28(9): 1002-1011 (2007).}

\bibitem{20}{G. Bieszczad and T. Sosnowski. \textit{Real-time mean-shift based tracker for thermal vision systems}.}

\bibitem{21}{A. Hang, E. Rukzio, and A. Greaves. \textit{Projector Phone: A Study of Using Mobile Phones with Integrated Projector for Interaction with Maps}. Proceedings of the Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI), 2008.}

\bibitem{22}{Brennesholtz, M (2008). \textit{Market Segment Analysis: Pico-Projectors}. Insight Media.}

\bibitem{23}{Schuon, Sebastian; Theobalt, Christian; Davis, James; Thrun, Sebastian (2008-07-15). \textit{High-quality scanning using time-of-flight depth superresolution}. written at Anchorage, Alaska.IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2008. Institute of Electrical and Electronics Engineers. pp. 1–7.}

\bibitem{24}{PMD[vision] CamCube 2.0 Datasheet No. 20090601 ed. Siegen, Germany: PMDTechnologies. 2009-06-01.  p. 5. Retrieved 2009-07-31. \textit{Type of Sensor: PhotonICs PMD 41k-S (204 x 204)}.}

\bibitem{25}{Fringe 2005, The 5th International Workshop on Automatic Processing of Fringe Patterns Berlin: Springer, 2006. ISBN 3-540-26037-4 ISBN 978-3-540-26037-0}

\bibitem{26}{David Fofi, Tadeusz Sliwa, Yvon Voisin. \textit{A Comparative Survey on Invisible Structured Light}. SPIE Electronic Imaging - Machine Vision Applications in Industrial Inspection XII, San José, USA, pp. 90-97, January 2004.}

\bibitem{27}{Raj. Kamal. \textit{Embedded systems: architecture, programming and design}. Tata McGraw-Hill Education, 2008.}

\bibitem{28}{www.primesense.com/}

\bibitem{29}{http://www.openni.org/}

\bibitem{30}{http://www.nokia.co.uk/support/product-support/n900/specifications}

\bibitem{31}{http://maemo.org/}

\bibitem{32}{www.scratchbox.org}

\bibitem{33}{http://www.usb.org/developers/onthego/}

\bibitem{34}{ \textit{Maemo developer mailing list}. Nokia R\&D. \\ http://lists.maemo.org/pipermail/maemo-developers}

\bibitem{35}{http://wiki.maemo.org/N900\_Hardware\_USB}

\bibitem{36}{http://wiki.maemo.org/Kernel\_Power}

\bibitem{37}{http://wiki.forum.nokia.com/index.php/Maemo\_5\_SDK\_installation\_for\_beginners}

\bibitem{38}{HP dv5-1110el product specifications:\\ http://www1.hp.com/}

\bibitem{39}{www.kinecthacks.net} 

\bibitem{40}{Compaq; Hewlett-Packard, Intel, Lucent, Microsoft, NEC, Philips (27 April 2000). \textit{Universal Serial Bus Specification Revision 2.0}. Retrieved 16 June 2010.}

\bibitem{41}{Jan Axelson. \textit{USB complete: everything you need to develop custom USB peripherals}. 3th edition, lakeview research llc, 2005.}

\bibitem{42}{Ajay Kumar Gupta. \textit{musb: add high bandwidth ISO support}.  03/04/2009: \\ 
http://git.kernel.org/}

\bibitem{43}{www.openkinect.org}

\bibitem{44}{http://maemo.org/packages/view/kernel-power-settings/}

\bibitem{45}{Ajay Kumar Gupta. \textit{musb: Add workqueue for URB giveback}. 25/11/2010: \\
http://arago-project.org/}

\bibitem{46}{www.meego.com}

\bibitem{47}{https://github.com/avin2/SensorKinect}

\bibitem{48}{https://github.com/avin2/SensorKinect}

\bibitem{49}{http://www.xbitlabs.com/articles/cpu/display/prescott\_10.html}

\bibitem{50}{http://www.columbia.edu/kermit/}

\bibitem{51}{http://www.myviliv.com/eng/product/n5.asp}

\bibitem{52}{Texas Instruments DLP Pico Projector Development Kit Version 2.0 specifications \\ http://focus.ti.com/docs/toolsw/folders/print/dlp1picokit.html}








\end{thebibliography}
%\bibliographystyle{alpha}
%\bibliographystyle{plain}
%\bibliography{bibliography}
%\clearpage
%\thispagestyle{empty}
%\printindex
%\clearpage
%\thispagestyle{empty}
\end{document}